RAW SEQUENCE LISTING 
ERROR REPORT 



bio: dmology 
systems 

BRANCH 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the/ollowing computer readable 
form: / 

Application Serial Number: 09/ 80*7^ 9 33 



Source: 



Date Processed by STIC: 



- 7~ Of 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS 
PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER- 
1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO TOP 
APPLICANT, WITH A NOTICE TO COMPLY or, ^MUNICATION TO THE 

? JoTrcE I T6?SM^Y CANT ANDFAX,NG AC0PY OF ™IS PRINTOUT, WITH A 
FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212 

■ ' / I- ■ ■ \ } ' -. ■«'■:■ . / • . ■ -. 

/ j . 

PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: natin21hel o@usDto.gov or phone 703-306-41 19 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@usnto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM ACCESSIBLE THROUGH THE U S PAT ENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listines 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October I 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 'nienectuai 

Checker Version 3 .0 replaces the previous DOS-based version of Checker and is Y2K 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CKh) before submitting them to the United States Patent and Trademark Office (USPTO) 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded frnm t he USPTO websit e at the following aHHr^- 

http://www.uspto.gov/web/offices/pac/checker 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/807,933 



DATE: 05/07/2001 
TIME: 13:40:33 



Input Set: 18079 33. RAW 



This Raw Listing contains the General 
Information Section and those Sequences 
containing ERRORS. 



<110> MEIJI SEIKA KAISHA, LTD . 

<120> Endoglucanase and cellase composition containing the 
same 

<130> 121659PX 

<140> US/09/807,933 

<141> 2001-04-20 

<150> JP302387>J998 

<15i> 1998-10-23 

<160> 113 

<170> Patentln Ver. 2.0 



Ooes 



^olCompW 



Cotte 



Diskette 



ERRORED SEQUENCES FOLLOW 



s— > 



ir 

12 

13 

14. 

-15 
16 
17 
18 
19 
20 
21 
22 
23 
24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 




<210> 
<211 

<212> PRT 

<213> Rhizopus oryzae CP96001 
<220> 

<221> sig^peptide 
1 v <222> (-23) ... (-1) 
n-.<221> mat_peptide 

<222> (1) ... (315) 

<400> 1 

Met Lys Phe lie Thr He Ala 

-20 

Gly Thr Glu Met Ala Ser Ala 
-5 

Cys Gly Gly Lys Asn Trp Asn 

10 15 
Thr Cys Lys Val Ser Asn Asp 

30 

Ser Ser Gly Asn Lys Ser Ser 

45 

Ala Ala His Lys Lys Thr Thr 
60 

Ala Pro Ala Lys Lys Thr Thr 

75 80 
Asn Ser Ser Ser Ser Ser Ser 
90 95 

Ala Ser Gly Asn Gly Val Thr 

110 

Ser Cys Ser Trp Pro Gly Lys 



gs*//7o ac/^p/j c///^er i 

* 



Ser Ser 
-15 
Ala Glu 

1 

Gly Pro 

Tyr Tyr 

Glu Ser 
50 

Thr Ala 
65 

Thr Val 
Gly Lys 
Thr Arg 
Ala Asn 



AXa Leu 

Cys Ser 

Thr Cys 
20 

Ser Gin 
35 

Ala His 

Ala His 

Ala Lys 

Tyr Ser 
100 
Tyr Trp 
115 

Val Ser 



Leu Ala Leu 
-10 

Lys Leu Tyr 
5 

Cys Glu Ser 

Cys Leu Pro 

Lys Lys Thr 
55 

Lys Lys Thr 
70 

Ala Ser Thr 
85 

Ala Val Ser 
Asp Cys Cys 
Ser Pro Val 



Ala Leu 

Gly Gin 

Gly Ser 
25 

Ser Gly 

40 
Thr Thr 

Thr Thr 

Pro Ser 

Gly Gly 
105 
Lys Ala 
120 

Lys Ser 



* 
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PAGE: 2 RAW SEQUENCE LISTING DATE: 05/07/2001 

• •• PATENT APPLICATION US/09/807,933 TIME: 13:40:33 

Input Set: 1807933 -RAW 

40 125 130 135 

41 Cys Asn Lys Asp Gly Val Thr Ala Leu Ser Asp Ser Asn Ala Gin Ser 

42 140 145 150 

4 3 Gly Cys Asn Gly Gly Asn Ser Tyr Met Cys Asn Asp Asn Gin Pro Trp 

44 155 160 165 

45 .Ala Val Asn Asp Asn Leu Ala Tyr Gly Phe Ala Ala Ala Ala lie Ser 

46 170 175 180 185 

47 Gly Gly Gly Glu Ser Arg Trp Cys Cys Ser Cys Phe Glu Leu Thr Phe 

48 190 195 200 

4 9 Thr Ser Thr Ser Val Ala Gly Lys Lys Met Val Val Gin Val Thr Asn 

50 205 210 215 

51 Thr Gly Gly Asp Leu Gly Ser Ser Thr Gly Ala His Phe Asp Leu Gin- 

52 220 225 230 

5 3 * Met Pro Gly Gly Gly Val Gly lie Phe Asn Gly Cys Ser Ser Gin Trp 
f£4 235 240 245 

5,5 Gly Ala Pro Asn Asp Gly Trp Gly Ser Arg Tyr Gly Gly lie Ser Ser ' 

56 250 255 260 265 

'57 ' Ala Ser Asp Cys Ser Ser Leu Pro Ser Ala Leu Gin Ala Gly Cys Lys 

58 270 275 230 

5 9 Trp Arg Phe Asn Trp Fhe Lys Asn Ala Asp Asn Pro Ser Met Thr Tyr 
60 285 290 295 

-61 Lys Glu Val Thr Cys Pro Lys Glu lie Thr Ala Lys Thr Gly Cys Ser 

62 300 305 310 

6 3 Arg Lys 
64 315 



67 <212> PRT _ 33 £ /,'j/*>o/ 

68 <213> Mucor circinelloides CP99001 

65 <22o> - 339 / con 

7 0 <221> sig_pep-tj.de 

71 <222> (-22) ... (-1) 

72 <221> mat_peptide 

73 <222> (1}...(316) 

74 <400> 7 

75 Met Lys Phe Thr Val Ala lie Thr Ser lie Ala Val Ala Leu Ala Leu 

E — > 78 ^= 

79 Ser Ser Ser Ala Glu Ala Ala Ser Cys Ser Ser Val Tyr Gly Gin 

E — > 80 -5 0 1 5 10 

81 Gly Gly lie Gly Trp Ser Gly Pro Thr Cys Cys Glu Ser Gly Ser 

82 15 20 25 

83 Cys Val Ala Gin Glu Gly Asn Lys Tyr Tyr Ser Gin Cys Leu Pro 

84 30 35 40 

85 Ser His Ser Asn Asn Ala Gly Asn Ala Ser Ser Thr Lys Lys Thr 

86 45 50 55 

87 Thr Lys Thr Ser Thr Thr Thr Ala Lys Ala Thr Ala Thr Val Thr 

88 60 65 70 



PCT 



PAGE: 3 ' ^ RAW SEQUENCE LISTING DATE: 05/07/2001 

/. PATENT APPLICATION US/09/807,933 TIME: 13:40:33 

Input Set: 1807933. RAW 

89 Lys Thr Val Thr Lys Thr Thr Thr Lys Thr Thr Thr Lys Thr Ser 

90 75 80 85 90 

91 Thr Ala Ala Ala Ser Thr Ser Thr Ser Ser Ser Ala Gly Tyr Lys 

92 95 100 105 

93 lie Ser Gly Gly Lys Ser Gly Ser Gly Ser Thr Thr Arg Tyr Trp 

94 110 115 120 

95 Cys Cys Lys Ala Ser Cys Ser Trp Pro Gly Lys Ala Ser Val Thr 

96 125 130 135 

97 Pro Val Asp Thr Cys Ala Ser Asn Gly lie Ser Leu Leu Asp Ala 

98 140 145 150 

99 Ala Gin Ser Gly Cys Asn Gly Gly Asn Gly Phe Met Cys Asn Asn 

100 155 160 165 170 

101 Gin Pro Trp Ala Val Asn Asp Glu Leu Ala Tyr Gly Phe Ala Ala 

102 175 180 185 

103 Ser lie Ala Gly Ser Asn Glu Ala Gly Trp Cys Cys Gly Cys Tyr 

104 190 195 200 

105 1 Leu Thr Phe Thr Ser Gly Ala Ala Ser Gly Lys Lys Met Val Val 

106 205 210 215 

107 *- Val Thr Asn Thr Gly Gly Asp Leu Gly Ser Asn His Fhe Asp Leu 
100. 220 225 230 

10 3 Met Fro Gly Gly Gly Val Gly lie Phe Asn Gly Cys Ala Ala Gin 

" HO ^ 235 240 245 250 

. Hi. Gly Ala Pro. Asn Asp Gly Trp Gly Ala Arg Tyr Gly Gly Val Ser 

112 255 260 265 

113 Val Ser Asp Cys Ala Ser Leu Pro Ser Ala Leu Gin Ala Gly Cys 

114 * 270 275 280 

115 J ■ Trp Arg Phe Asn Trp Phe Lys Asn Ser Asp Asn Pro Thr Met Thr 

116 285 290 295 

117 t \ Lys Glu Val Thr Cys Pro Ala Glu Leu Thr Thr Arg Ser Gly Cys 

118 300 305 310 

119 Arg" 

120 315 



121 <2io>jl — ~/o/y 

E— > 122 <2IlO01j^y 5> ' J 

123 <212> DNA ~ /O 32 /Ln/ 

124 <213> Mucor circinelloides CP99001 

125 <220> 

126 <221> sig_peptide 

127 <222> (1) ... (66) 

128 <221> mat_peptide 

129 <222> (67) . . . (1017) 

130 <400> 8 

131 atg aag ttc acc gtt get att act tea ate get gtt gca etc get etc 48 

132 Met Lys Ph Thr Val Ala lie Thr Ser lie Ala Val Ala Leu Ala 
W— > 133 ^ J / J 

E — > 134 0Q6Q-2 0QQQ@ee@Q7 l5 ^ .aQ-/e.r<t _ 10 

135 age tct tct get gaa get gctrxct tgc age tct gtc tat ggt caa tgt 96 

136 Ser Ser Ser Ala Glu Ala ^V*a Ser Cys Ser Ser Val Tyr Gly Gin 
W— > 137 -5 /@U 5 10 



PCT 



PAGE : 4 ^J? QUENCE LISTING DATE : 05/07/2001 

PATENT APPLICATION US/09/807,9 33 TIME: 13:40:33 



138 
139 



144 
145 



150 
151 



153 
154 
155 
156 
157 



159 
160 
161 
162 
163 



180 
181 



183 
184 
185 
186 
187 



Input Set: 1807933 .RAW 



ggt ggc att gga tgg agt gga cct acc tgt tgt gaa agt ggc tct act 144 
Gly Gly He Gly Trp Ser Gly Pro Thr Cys Cys Glu Ser Gly Ser 

140 15 20 25 

141 tgc gtt get caa gaa ggc aac aaa tac tac tct caa tgt ctt ccc gga 192 

142 cys Val Ala Gin Glu Gly Asn Lys Tyr Tyr Ser Gin Cys Leu Pro 

143 30 35 40 

tec cac agt aac aat get ggt aac get age age acc aag aag aca tct 240 
Ser His Ser Asn Asn Ala Gly Asn Ala Ser Ser Thr Lys Lys Thr 

146 45 50 55 

liJ acc a ^g aca tct act acc acc gec aag get act get act gtc acc acc 288 

a Lys Ala Thr Ala Thr Val Thr 
149 60 65' 70 

aag aca gta acc aag aca act acc aag aca act acc aag act age act 336 
Lys Thr Val Thr Lys Thr Thr Thr Lys Thr Thr Thr Lys Thr Ser 
X i 52 75 30 85 90 

act gee get get tct act tec acc tct tct tct get ggt tac aag gtc 384 
Thr Ala Ala Ala Ser Thr Ser Thr Ser Ser Ser Ala Gly Tyr Lys 

95 100 105 

ate tct ggc ggt aaa tct ggc agt ggt tec aca act cgt tat tgg gat 432 
He Ser Gly Gly Lys Ser Gly Ser Gly Ser Thr Thr Arg Tyr Tro 
153 HO 115 

tgt tgt aaa get tct tgc age tgg cct gga aaa get tct gtc act ggt «80 
Cys cys Lys Ala Ser Cys Ser Trp Pro Gly Lys Ala ser Val Thr 

125 130 135 

cct gtt gac acc tgt gee tec aat ggt ate tct tta tta gat gec aat 528 
Pro Val Asp Thr cys Ala Ser Asn Gly He ser Leu Leu Asd Ala 
164 .:.- . 140 145 150 

165 »• get caa agt ggt tgt aac ggt ggt aat ggt ttc atg tgt aac aac aac 576 

1£6 Ala Gin Ser Gly cys Asn Gly Gly Asn Gly Phe Met Cys Asn Asn 

1 , 55 160 165 170 

168 caa cct tgg get gtc aat gat gag etc get tac ggt ttc get get gec 62* 

Ab9 Gln Pro Tr P A la Val Asn Asp Glu Leu Ala Tyr Gly Phe Ala Ala 

170 175 180 185 

171 tct att get ggc tec aac gaa get gga tgg tgt tgt ggc tgt tat gaa 672 
L '* Ser lie Ala Gly Ser Asn Glu Ala Gly Trp Cys Cys Gly Cys Tyr 

173 190 195 200 

174 ttg acc ttc act tct ggc get get tct gga aag aag atg gtt gtt caa 720 

175 Leu Thr Phe Thr Ser Gly Ala Ala Ser Gly Lys Lys Met' Val Val 

176 205 210 215 

177 gtt acc aac acc ggt gge gat tta ggc tct aac cac ttt gat ttg caa 768 

178 val Thr Asn Thr Gly Gly Asp Leu Gly Ser Asn His Phe Asp Leu 

179 220 225 230 

atg ccc ggt ggt ggc gtt ggt ate ttc aat ggc tgt get get caa tgg 816 
Met Pro Gly Gly Gly Val Gly He Phe Asn Gly Cys Ala Ala Gin 

182 235 24 ° 245 250 

ggc get ccc aat gat ggc tgg gga get aga tat ggt ggt gtc age tct 864 
Gly Ala Pro Asn Asp Gly Trp Gly Ala Arg Tyr Gly Gly Val Ser 

255 260 265 

gtc tct gac tgt gec tct ctt ccc tct get ctt caa get ggt tgt aaa 912 
Val Ser Asp Cys Ala Ser Leu Pro Ser Ala Leu Gin Ala Gly Cys 



* 



PCT 

PAGE: 5 RAW SEQUENCE LISTING DATE: 05/07/2001 

PATENT APPLICATION US/09/807,933 TIME: 13:40:33 

Input Set: 1807933, RAW 

!88 270 275 280 

189 tgg aga ttc aac tgg ttc aag aac tct gat aac cct acc atg acc ttc 960 

190 Trp Arg Phe Asn Trp Phe Lys Asn Ser Asp Asn Pro Thr Met Thr 

1 9 1 285 290 295 

192 aag gaa gtt acc tgt cct get gaa tta act act cgc tea ggt tgc gaa 1008 

193 Lys Glu Val Thr Cys Pro Ala Glu Leu Thr Thr Arg Ser Gly Cys 

1 94 300 305 310 

1 9 5 aga aag taa 1017 

196 Arg 

197 315 



198 <210> 10 . ~// ' £ L S //S^tf 

E — > 199 <211> 1164 < ~> _ . 

200 <212> DttA ///£ -/ocjr>0< 

201 <213> Mucor circinelloides CP99001 

202 <220> 

2D3 <221> sig_peptide 

204 <222> (I)... (66) 

- 205 <221> mat_peptide 

.206 <222> (67) . . . (1164) 

207 <40G> 10 

208 atg aag ttc acc gtt get att act tea ate get gtt gca etc get etc 48 
20y Met Lys Phe Thr Val Ala He Thr ser He Ala Val Ala Leu Ala 

210 -20 -15 -10 

211 age tct tct get gaa get get tct tgc age tct gtc tat ggt caa tgt 96 

212 Ser Ser Ser Ala Glu Ala Ala Ser Cys Ser Ser Val Tyr Gly Gin 

213 . -5 1 5 10 

214 - ggt ggc att ggc tgg act ggt cct aca tgt tgt gat get gga teg acc 14 4 

215 Gly Gly He Gly Trp Thr Gly Pro Thr Cys Cys Asp Ala Gly Ser 

216 IS 20 25 

217 tgt aaa get caa aag gat aac aaa tat tat tct caa tgt att ccc aaa 192 

218 Cys Lys Ala Gin Lys Asp Ash Lys Tyr Tyr Ser Gin Cys lie Pro 

219 30 35 40 

220 ccc aag ggt tec tec tea tea tea tea tgt agt tec gtc tat agt caa 240 

221 Pro Lys Gly Ser Ser Ser Ser Ser Ser Cys Ser Ser Val Tyr Ser 

222 45 50 55 

223 tgc ggt ggc att gga tgg agt gga cct acc tgt tgt gaa agt ggc tct 288 

224 Cys Gly Gly He Gly Trp Ser Gly Pro Thr Cys Cys Glu Ser Gly 

22 5 60 65 70 

226 act tgc gtt get caa gaa ggc aac aaa tac tac tct caa tgt ctt ccc 336 

227 Thr Cys Val Ala Gin Glu Gly Asn Lys Tyr Tyr Ser Gin Cys Leu 

228 75 80 85 90 

229 gga tec cac agt aac aat get ggt aac get age age acc aag aag aca 384 

230 Gly Ser His Ser Asn Asn Ala Gly Asn Ala Ser Ser Thr Lys Lys 

231 95 100 105 

232 . "tct acc aag aca tct act acc acc gee aag get act get act gtc acc 432 

233 Ser Thr Lys Thr Ser Thr Thr Thr Ala Lys Ala Thr Ala Thr Val 

234 110 115 120 

235 acc aag aca gta acc aag aca act acc aag aca act acc aag act age 480 

236 Thr Lys Thr Val Thr Lys Thr Thr Thr Lys Thr Thr Thr Lys Thr 



PCT 



PAGE : 



245 
246 
247 
248 
249 
250 
251 



275 
276 

E — > 277 
W — > 278 



RAW SEQUENCE LISTING DATP . -n* /n-/ /«nni 

PATENT APPLICATION US/09/807,9 33 TIMe! 13:4o';33 

Input Set: 1807933. HAW 



2 3 7 X 2 5 

239 " Thr Thr" Tit !?* f?* T *~ tCt tCt tCt taC aa * 

239 Thr Thr Ala Ala Ala Ser Thr Ser Thr Ser Ser Ser Ala Gly Tyr 

240 i 4 o 145 150 

III t! C T 99t 333 tCt " C agt ggt tcc a " «t cgt tat tgg 

242 Val lie Ser Gly Gly Lys Ser Gly Ser Gly Ser Thr Thr Arg Tyr 

155 160 i 65 

IV* ' J"* l 9t t9t 338 9Ct tGt t9C agC tgg cct gga aaa act tct gtc act 

245 Asp Cys Cys Lys Ala Ser Cys Ser Trp Pro Gly Lys Ala Ser val 

175 180 i 8 5 



528 



576 



624 



ggt cct gtt gac acc tgt gec tcc aat ggt ate tct tta tta gat gee 672 
Gly Pro val Asp Thr Cys Ala Ser Asn Gly lie Ser Leu Leu Asp 

190 I" 200 

aat get caa agt ggt tgt aac ggt ggt aat ggt ttc atg tgt aac aac 
Asn Ala Gin Ser Gly Cys Asn Gly Gly Asn Gly Phe Met Cys Asn 

205 210 215 

aac caa cct tgg get gtc aat gat gag etc get tac ggt ttc get get 



720 



?52 ' 205 210 215 

253 13 

254 Asn Gln Pro Trp Ala Vfll " sp — — 

. 220 225 , 30 

256 .... - JU 



768 



gec tct att get gge tcc aac gaa get gga tgg tgt tgt ggc tgt tat 
oc c ■ V"! £ ' IlH Aia Gly Ser Asn Glu Ala . Gly Trp Cys Cys Gly Cys 

240 245 250 

TJ, "! "! * Ct ? gC ??* g f t tct gga aa 9 "9 ^g gtt gtt 



816 



259 
260 
261 
262 

263 ;• 

264 270 
265 
266 



864 



Glu Leu Thr Phe Thr Ser Gly Ala Ala Ser Gly Lys Lys Met Val 

255 260 255 

caa gtt acc aac acc ggt ggc gat tta ggc tct aac cac ttt oat ttq 912 
Gin Val Thr Asn Thr Gly Gly Asp Leu Gly Ser Asn His Phe Asp 

275 280 



caa atg ccc ggt ggt ggc gtt ggt ate ttc aat ggc tgt get act 
Gin Met Pro Gly Gly Gly V al Gly He Phe Asn Gly cys Ala Ala 



caa 960 



267 285 290 295 



2" Itl nil Tit T a** T 993 9Ct 393 tat ggt ggt gtc ^ c 1008 

Trp Gly Ala Pro Asn Asp Gly Trp Gly Ala Arg Tyr Gly Gly Val 

300 305 310 

£? *!f ^ ' Ct C " CCC tct gct ctt caa act ggt tgt 



269 

270 300 
271 
272 



1056 



273 315 ^ ^ ^ ^ ^ ^ ^ °* y 

aa : is :?! si aac i gg i tc aag aac tct *<* III n 04 



Lys Trp Arg Phe Asn Trp Phe Lys Asn Ser Asp Asn Pro Thr Met 

335 3 *0 345 

ttc aag gaa gtt acc tgt cct gct gaa tta act act cgc tea ggt tgc 



1152 Phe 



279 Glu Val Thr Cys Pro Ala Glu Leu Thr Thr Arg Ser Gly ^ 



280 

E > 281 gaa aga aag taa 

282 Glu Arg 

233 365 



350 355 360 



1164 



284 <210> 12 
E — > 285 <211> 1041 



PCT 



PAGE : 7 



RAW SEQUENCE LISTING DATE • 05/07/2001 

PATENT APPLICATION US/09/807,933 TIME: 13:40:33 



Input Set: 1307933. RAW 

286 <212> DNA 

'287 <213> Phycomyces nitens CP99002 

288 <220> 

289 <221> sig_peptide 

290 <222> .(1) , . . (57) 

291 <221> mat_peptide 

292 <222> (58) . . . (1041) 

293 <400> 12 

294 atg aag ttc tec ate ate gct tec gcc ctt etc etc gct gcc age tec 48 

295 Met Lys Phe Ser He He Ala Ser Ala Leu Leu Leu Ala Ala Ser 9 

~ i5 -10 -5 

act tac gct gct gaa tgc age caa ggc tat ggc cag tgt ggt ggc aag 96 
Thr Tyr Ala Ala Glu Cys Ser Gin Gly Tyr Gly Gin Cys Gly Gly ' ? 

1 5 io 

atg tgg act ggt ccc acc tgc tgc acc tec ggc ttc acc tgt gta ggt 144 
Met Trp Thr Gly Pro Thr Cys Cys Thr Ser Gly Phe Thr Cys Val ? 
302 15 20 25 

gcc gaa aac aac gag tgg tac tct cag tgt ate ccc aac gat caa gtc 192 
Ala Glu Asn Asn Glu T ^P Tyr Ser Gin Cys He Pro Asn Asp Gin ? 

* I 30 35 40 4s 

3 °° Cay 9gt aac ccc aa 9 acc acc acc acc acc acc acc aag get gcc act 240 

Gin Gly Asn Pro Lys Thr Thr Thr Thr Thr Thr Thr Lys Ala Ala ? 

50 55 60 

acc acc aag gct cct gtc acc acc acc aag gcc acc acc acc acc acc ?8A 
Thr Thr Lys Ala Pro Val Thr Thr Thr Lys Ala Thr Thr Thr Thr 7 

65 70 75 

312, acc aag gcc cct gtc acc acc acc aag gcc act act act acc acc acc 336 

313 Thr Lys Ala Pro Val Thr Thr Thr Lys Ala Thr Thr Thr Thr Thr ? 



296 
297 
298 
299 
300 
301 
302 
303 

304 

-» 
j 

3 

307 
308 

309 
310 
311 



W— > 314 tpt^ ^ /I M j\ 

E— > 31$pe?^^aa«8>0^ 85^ 90 ^ 

316 ' ^-aag 



317 



acc acc acc aag acc acc acc acc aag get gcc acc acc acc tec 
Lys Thr Thr Thr Lys Thr Thr Thr Thr Lys Ala Ala Thr Thr Thr 7 



384 



318 95 100 105 

319 tct tec aac act ggc tac age ccc att tct ggt ggc ttc tct* gga aac 432 
Zd Ser Ser Asn Thr G ly Tyr Ser Pro He Ser Gly Gly Phe- Ser Gly ? 

321 110 "5 120 ' i 2 5 

ggt cgc act acc cgc tac tgg gat tgc tgc aag ccc tct tgc gcc tgg 480 
Gly Arg Thr Thr Arg Tyr Trp Asp Cys Cys Lys Pro Ser Cys Ala 7 

130 135 140 

gac 9<? a aag gct tct gta act aag cct gta etc acc tgt gcc aag gat 528 
J26 Asp Gly Lys Ala Ser Val Thr Lys Pro Val Leu Thr Cys Ala Lys ? 

145 150 155 

ggt gtc age cgt etc ggt tec gat gtc cag age ggt tgc gtc ggc ggc 576 
Gly Val Ser Arg Leu Gly Ser Asp Val Gin Ser Gly Cys Val Gly 7 
330 160 165 170 * 

cag gcc tac atg tgc aat gac aac cag ccc tgg gtt gtc aat gac gac 624 
Gin Ala Tyr Met Cys Asn Asp Asn Gin Pro Trp Val Val Asn Asp 7 

333 175 180 185 

334 ctt gcc tac ggt ttc gct gct gcc agt etc ggt age gcc ggt gcc tct 672 
3 35 Leu Ala Tyr Gly Phe Ala Ala Ala Ser Leu Gly Ser Ala Gly Ala ? 



322 

323 

324 

325 

326 

327 

328 

329 

330 

331 

332 



« 



PCT 
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s — > 

w .? 



E — > 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/807,933 



DATE: 05/07/2001 
TIME: 13:40:33 



input Set: 1807933 . RAW 



336 


190 










195 








337 


qca 


ttc 


tgc 


tgc 


J 3 


tqt 


tac 


gag 


ctt 


338 


•Ala 


Phe 


Cys 


Cys 


Gly 


Cys 


Tyr 


Glu 


Leu 


339 










210 










340 


get 


qqc 

ZJ Z3 


aag 


aag 


ttt 


ate 


ate 


cag gtc 


341 


Ala 


Gly 


Lys 


Lys 


Phe 


Val 


Val 


Gin 


Val 


342 








225 


- 








230 


343 


age 


ace 


aac 


cac 


ttt 


oat 


tta 


caq 


atq 


344 


$er 


Thr 


Asn 


His 


Phe 


Asp 


Leu 


Gin 


Met 


345 






240 










OAR 




346 


ttc 
-i 


aac 


ggc 


tgc 


caa 


tec 


v- a y 


tgg 


aac 


347 


P le 


Asn 


Gly Cys 


Gin 


Ser 


Gin 


Trp 


Asn 


348 




255 










260 






349 


get 


cgc 

ZJ 


tat 


ggc 


qqt 


att 


aac 

u *j 


tct 


att 


350 


Ala 

n -L- u 


Atcj 


Tyr 


Gly 


vj _i_ _y 






Ser 


lie 


351 


270 










275 








352 


ace 


cag 


ttg 


cag 


get 


ggt 


tgc 


aag 


tgg 


353 


Thr 


Gin 


Leu 


Gin 


Ala 


Gly 


Cys 


Lys 


Trp 


354 










290 










355 


get 


gac 


sac 


CC2 


gsg 


gtc 


acc 


ttc 


aag 


356 * 




















357 


Asn 


Pro 


Glu 


Val 


Thr 


Phe 


Lys 


Ala 


Val 


358 








305 










310 


35 9 


ate 


* t 

att 


Zi — 


aag 


act 


ggt 


tgc 


gag 


cgc 


360 


lie 


lie 


Ala 


Lys 


Thr 


Gly 


Cys 


Glu 


Arg 


36-1 






320 










325 





200 
acc ttc 
Thr Phe 
215 

acc aac 
Thr Asn 

ccc ggc 
Pro Gly 

acc aac 
Thr Asn 



acc aac 
Thr Asn 

acc ggt 
Thr Gly 



Ser 



gag 
Glu 
280 



ggt 
Gly 

acc 
Thr 
265 
tgc 
Cys 



ggt 
Gly 
250 
gat 
Asp 

gac 
Asp 



act 
Thr 

gat 

Asp 
235 
gtc 
Val 



get 
Ala 
220 
gat 
Asp 

ggc 
Gly 



205 
gtc 



ggc tgg 

Gly Trp 

aag etc 
Lys Leu 



295 



gga tgg ttc aag 
Gly Trp Phe Lys 

300 

act tgc cct gec 



etc 

? 

tac 

? 

* 

ggt 
7 

cct 

7 

285 
aac 
1 



315 



362 
363 

364 
365 
366 
367 
368 
369 



<210> 51 
<211^22^ 
<212> DNA 



-22 As/cc/ 
2 3 < /o cssi c/ 



<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 51 

cagagggaag acattgagag tag 



720 



768 



816 



864 



912 



960 



gag 1008 




1041 




PAGE: 9 - VERIFICATION SUMMAR^W DATE : 05/07/2001 

PATENT APPLICATION US/09/807,933 TIME: 13:40:33 

Input Set: 1807933. RAW 



Line ? 



Error/Warning 



Original Text 



12 E Input 328, Calc Seq. Length 3 38 differ 
66 S Input 338,- Calc Seq. Length 339 differ 

76 fi Invalid/Missing Amino Acid Numbering 

77 E Wrong Amino Acid Designator 

78 E Invalid/Missing Amino Acid Numbering 
80 E Invalid/Missing Amino Acid Numbering 

122 E Input 1017, Calc# Bases 1032 differ 

133 W Invalid/Missing Amino Acid Numbering 

134 E Number of Bases conflict w/ Running Tota 
134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

13^ E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 e Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

134 E Wrong Nucleic Acid Designator 

137 W Invalid/Missing Amino Acid Numbering 

199 E Input 0 1164>, Calc# Bases 1116 differ 

277 E Wrong Amino Acid " Desianator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

277 E Wrong Amino Acid Designator 

278 W Invalid/Missing Amino Acid Numbering 

281 E Number of Bases conflict w/ Running Total 
285 E Input 1041, Calc# Bases 997 differ 
314 W Invalid/Missing Amino Acid Numbering 



<211> 323 
<211> 338 

@@@@-20@@@@@@@@ -15 
<211> * 1017 

1 @@@@-20@@@<a@<a<a@ _i 5 

@@@@-20@@@(a@<a<a@ _i 5 

@@@@-20<a@(a@@@(a<a -15 

@@@@-20@@<a@<a@@<a -15 

@@@@-20@@@<a@<a(a(a -15 

@@@@-20@@@@@@@@ -15 
@<a@@-20@@(a@<a@@{a -15 

@@@@-20@@@<a<a@G@ -15 
@@@@-20@@@@(a<a<a<a -15 
@@@@-20@<§<a@<a{a<a<a -15 
<s<a@@-20@@@<a<a<a<a@ -15 
@@@@-2Q@@<a<a@<a@@ _ 15 
@@@@-20@@@@@aa@ -is 
@@@@-20@@@@<a<a@@ -15 
@@<£@-20<a@<a@@@<a@ -15 
@@@@-20@@@@<a<a<a<a _i 5 
@@@@-20@@<a@<a@@{a -15 
@@<§@-20@@(a@@<a@«a -15 
@@<a@-20@@@@@<a@@ -15 
@@<a@-20@<a@<a@@@@ -15' 

-5 @ 1 

<211> 1164 

ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa. gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 
ttc aag gaa gtt acc tgt cct 

gaa aga aag taa 
<211> 1041 



-10 



-10 
' -10 
-10 
-10 
-10 
-10 
-10 
-10 
-10 

-10 

-10 
-10 
-10 
-10 
-10 
-10 
-10 
-10 
-10 
-10 



get 
get 

get 
get 
get 

get 

get 

get 

get 

get 

get 

get 

get 

get 

get 

get 



gaa tta. a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 
gaa tta a 



PAGE: 10 tc VERIFICATION SUMMAH* Di*TE " 05 / 07 /o 0 

PATENT APPLICATION US/09/807 ,933 TIME: 13:4(>':33 

Input Set: 1807933. RAW 

Line ? Error/Warning Original Text 

315 E Number of Bases conflict w/ Running Total 80 85 

315 E Wrong Nucleic Acid Designator @<a<a@ 80 85 

315 E Wrong Nucleic Acid Designator @(a@@ go 85 

315 E Wrong Nucleic" Acid Designator @<a@<a 80 85 

315 E Wrong Nucleic Acid Designator @<a<a<a 8 0 

355 E Wrong Amino Acid Designator. gct gac aac cca gag gtc acc ttc aaa act a 

355 E Wrong Amino Acid Desiqnator ^ ^+ ~, Z. 9 9 9 

UBaj -y ,ldLor get gac aac cca qaq qtc acc ttc nan nr+ n 

355 E Wrong Amino Acid -Designator n <~¥ „ ar . = o„ Z " 9 9 9 

y get gac aac cca qaq qtc acc tt<~ aan „ r t ,-, 

355 E Wrong Amino Acid Desiqnator „„„ ^ " 9 9 9 

,„ „ ... 3 . . „ . , s " aior act Qac aac cca gag gtc acc ttc aaq qct 

355 E Wrong Ammo Acid Designator act aar aar Z.J. ^ Z 

355 E Wrong Amino Acid Desiqnator * 9 9 9 3CC ttC 389 gct 

y «inxuu Htia Designator get gac aac cca qaq qtc acc ttc a-ia nn+ n 

355 E Wrong Amino Acid Designator qct „ac -lac rr, Lo ^ ! 9 I 9 

355 E Wrong Amino Acid Designator act ITr Z 9 9 ^ 339 9Ct 9 

355 E Wrong Amino Acid Designator T c t !!° 939 9 * C 3CC ttC 339 gct 9 

355 E Wrong Amino Acid Designator !L alt T ^ ^ 339 9Ct 9 

rss p * ■» " e>3i9na ^ or 9" gac aac cca gag gtc acc ttc aag gct g 

Jjb t. v/rong Amino Acid Desianator _,_+. _„ . 



g 
g 
g 
g 
g 



g 
g 



355 E Wrong Amino Acid Designator act „J ~" % x\ 9 9 
<ss v wrnJ « . , ^ .9 n « or get gac aac cca gag gtc acc ttc aag gct q 
Jbb k Wrong Ammo Acid Designator act aac nr r-c* „ • ± j_ 
-3-- " * • * . . 9 gac aac cca g a g 9^ c ac c xtc aaa qct a 
3d 3 £, -Wrong Airano Acid Desiqnator \ L y 9 
-cc ri ri «. • . „ J gac aac cca gag gtc acc ttc aaq act a 
^55 E Wrong -Ammo Acid Designator „-+ \ y c 9 

, . . „ 3 °" 9 ct aac cca gag gtc acc ttc aaq qct a 

3o3 vi wrong Ainxnu Acid Designator ri ^h „ , y J g 

„ „ 3 y ia L - u - L gec gac aac cca qaq ate acc ttc rrr- K ^ 

356 W Invalid/Missing Amino Acid Numbering 9 9 9 
363 E Input 22, Calcft Bases 23 differ <211> 22 



